Theoretical Error Prediction for a Language Identi cation System using Optimal Phoneme Clustering

نویسندگان

  • Kay M. Berkling
  • Etienne Barnard
چکیده

using Optimal Phoneme Clustering Kay M. Berkling, Etienne Barnard (berkling,barnard)@cse.ogi.edu Center for Spoken Language Understanding, Oregon Graduate Institute of Science and Technology Abstract A neural network based language identi cation system is described, which uses language independent phoneme clusters as speech units to recognize the language spoken by native speakers over the telephone. We extend our previous work comparing phoneme-cluster and phoneme based approaches to language identi cation [1]. By creating a new speech unit valid across all languages in a theoreticallymotivatedmanner, we circumvent problems that are associated with ne phonemic modelling such as high complexity [4], extensive training requirements [2], and the linguistically arbitrary reduction to subsets of phonemes [4]. A common set of speech units across languages allows us to automatically derive discriminating sequences of any length and theoretically estimate the language identi cationerror. We demonstrateour implemented system for German vs. English on the OGI-TS database.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Theoretical error prediction for a language identification system using optimal phoneme clustering

using Optimal Phoneme Clustering Kay M. Berkling, Etienne Barnard (berkling,barnard)@cse.ogi.edu Center for Spoken Language Understanding, Oregon Graduate Institute of Science and Technology Abstract A neural network based language identi cation system is described, which uses language independent phoneme clusters as speech units to recognize the language spoken by native speakers over the tele...

متن کامل

Language identification with language-independent acoustic models

In this paper we explore the use of languageindependent acoustic models for language identi cation (LID). The phone sequence output by a single language-independent phone recognizer is rescored with language-dependent phonotactic models approximated by phone bigrams. The language-independent phoneme inventory was obtained by Agglomerative Hierarchical Clustering, using a measure of similarity b...

متن کامل

Fast bootstrapping of LVCSR systems with multilingual phoneme sets

In this paper we described an e cient method to bootstrap continuously spoken, large vocabulary speech recognition systems by multilingual phoneme sets. To evaluate this techniques we collected the multilingual database GlobalPhone which currently consists of 9 di erent languages. A multilingual recognizer (MULTI) based on the four languages German, English, Japanese and Spanish was developed t...

متن کامل

Language identification of six languages based on a common set of broad phonemes

ON A COMMON SET OF BROAD PHONEMES Kay M. Berkling ([email protected]), Etienne Barnard ([email protected]) Center for Spoken Language Understanding, Oregon Graduate Institute of Science and Technology, 20000 N.W. Walker Road, P.O. Box 91000, Portland, OR 97291-1000, USA ABSTRACT We describe a system designed to recognize the language of an utterance spoken by any native speaker over the te...

متن کامل

SECURING INTERPRETABILITY OF FUZZY MODELS FOR MODELING NONLINEAR MIMO SYSTEMS USING A HYBRID OF EVOLUTIONARY ALGORITHMS

In this study, a Multi-Objective Genetic Algorithm (MOGA) is utilized to extract interpretable and compact fuzzy rule bases for modeling nonlinear Multi-input Multi-output (MIMO) systems. In the process of non- linear system identi cation, structure selection, parameter estimation, model performance and model validation are important objectives. Furthermore, se- curing low-level and high-level ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1995